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Adaptively adjusting system parameters including bandwidth, transmit power and mode to maximize tlie "Bits 
per-Joule" energy efficiency (BPJ-EE) in the downlink MIMO systems with imperfect channel state information 
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^\ ' at the transmitter (CSIT) is considered in this paper. By mode we refer to choice of transmission schemes i.e. 

^SJ ' user number. We derive optimal bandwidth and transmit power for each dedicated mode at first, in which accurate 

[~^ ■ capacity estimation strategies are proposed to cope with the imperfect CSIT caused capacity prediction problem. 



singular value decomposition (SVD) or block diagonalization (BD), active transmit/receive antenna number and active 



Then, an ergodic capacity based mode switching strategy is proposed to further improve the BPJ-EE, which provides 
insights on the preferred mode under given scenarios. Mode switching compromises different power parts, exploits the 
tradeoff between the multiplexing gain and the imperfect CSIT caused inter-user interference, improves the BPJ-EE 
significantly. 

Index Terms 

"Bits per-Joule" energy efficiency (BPJ-EE), downlink MIMO systems, singular value decomposition (SVD), 
block diagonalization (BD), imperfect CSIT. 

L Introduction 

Energy efficiency is becoming increasingly important for the future radio access networks due to the climate 
change and the operator's increasing operational cost. As base stations (BSs) take the main parts of the energy 
consumption |[T|, ||2l, improving the energy efficiency of BS is significant. Additionally, multiple-input multiple- 
output (MIMO) has become the key technology in the next generation broadband wireless networks such as WiMAX 
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and 3GPP-LTE. Therefore, we will focus on the maximizing energy efficiency problem in the downlink MIMO 
systems in this paper 

Previous works mainly focused on maximizing energy efficiency in the single-input single output (SISO) systems 
ID-llTl and point to point single user (SU) MIMO systems lU-IlOl. In the uphnk TDMA SISO channels, the optimal 
transmission rate was derived for energy saving in the non-real time sessions O. G. W. Miao et al. considered the 
optimal rate and resource allocation problem in OFDMA SISO channels H-lISl. The basic idea of Il3l-|l6| is finding 
an optimal transmission rate to compromise the power amplifier (PA) power which is proportional to the transmit 
power and the circuit power which is independent of the transmit power. S. Zhang et al. extended the energy 
efficiency problem to a bandwidth variable system Q and the bandwidth-power-energy efficiency relations were 
investigated. As the MIMO systems can improve the data rates compared with SISO/SIMO, the transmit power can 
be reduced under the same rate. Meanwhile, MIMO systems consume higher circuit power than SISO/SIMO due to 
the multiplicity of associated circuits such as mixers, synthesizers, digital-to-analog converters (DAC), filters, etc. m 
is the pioneering work in this area comparing the energy efficiency of Alamouti MIMO systems with 2 antennas and 
SIMO systems in the sensor networks. H. Kim et al. presented the energy efficient mode switching between SIMO 
and 2 antenna MIMO systems |9|. A more general link adaptation strategy was proposed in ifTOll and the system 
parameters including number of data streams, number of transmit/receive antennas, use of spatial multiplexing or 
space time block coding (STBC), bandwidth etc. were controlled to maximize the energy efficiency. However, to 
the best of our knowledge, there are few literatures considering energy efficiency of the downlink multiuser (MU) 
MIMO systems. 

The number of transmit antennas at BS is always larger than the number of receive antennas at the mobile station 
(MS) side because of the MS's size limitation. MU-MIMO systems can provide higher data rates than SU-MIMO 
by transmitting to multiple MSs simultaneously over the same spectrum. Previous literatures mainly focused on 
maximizing the spectral efficiency of MU-MIMO systems, some examples of which are ifTTl - lfTSl . Although not 
capacity achieving, block diagonaUzation (BD) is a popular linear precoding scheme in the MU-MIMO systems 
lfTn - lfT4l . Performing precoding requires the channel state information at the transmitter(CSIT) and the accuracy 
of CSIT impacts the performance significantly. The imperfect CSIT will cause inter-user interference and the 
spectral efficiency will decrease seriously. In order to compromise the spatial multiplexing gain and the inter-user 
interference, spectral efficient mode switching between SU-MIMO and MU-MIMO was presented in ifTSl - lfTSl . 

Maximizing the "Bits per- Joule" energy efficiency (BPJ-EE) in the downlink MIMO systems with imperfect CSIT 
is addressed in this paper A three part power consumption model is considered. By power conversion (PC) power 
we refer to power consumption proportional to the transmit power, which captures the effect of PA, feeder loss, and 
extra loss in transmission related cooling. By static power we refer to the power consumption which is assumed to 
be constant irrespective of the transmit power, number of transmit antennas and bandwidth. By dynamic power we 
refer to the power consumption including the circuit power, signal processing power, etc. and it is assumed to be 
irrespective of the transmit power but dependent of the number of transmit antennas and bandwidth. We divide the 
dynamic power into three parts. The first part "Dyn-I" is proportional to the transmit antenna number only, which 
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can be viewed as the circuit power. The second part "Dyn-II" is proportional to the bandwidth only, and the third 
part "Dyn-III" is proportional to the multiplication of the bandwidth and transmit antenna number "Dyn-lI" and 
"Dyn-III" can be viewed as the signal processing power etc.. Interestingly, there are two main tradeoffs here. For 
one thing, more transmit antennas would increase the spatial multiplexing and diversity gain which leads to transmit 
power saving, while more transmit antennas would increase "Dyn-I" and "Dyn-III" which leads to dynamic power 
wasting. For another, multiplexing more active users with higher multiplexing gain would increase the inter-user 
interference, in which the multiplexing gain makes transmit power saving but inter-user interference induces transmit 
power wasting. In order to maximize BPJ-EE, the tradeoff among PC power, static power and dynamic power needs 
to be resolved and the tradeoff between the multiplexing gain and imperfect CSIT caused inter-user interference 
also needs to be carefully studied. The optimal adaptation which adaptively adjusts system parameters such as 
bandwidth, transmit power, use of singular value decomposition (SVD) or BD, number of active transmit/receive 
antennas, number of active users is considered in this paper to meet the challenge. 

Contributions. By mode we refer to choice of transmission schemes i.e. SVD or BD, active transmit/receive 
antenna number and active user number For each dedicated mode, we prove that the BPJ-EE is monotonically 
increasing as a function of bandwidth under the optimal transmit power without maximum power constraint. 
Meanwhile, we derive the unique globally optimal transmit power with a constant bandwidth. Therefore, the optimal 
bandwidth is chosen to use the whole available bandwidth and the optimal transmit power can be correspondingly 
obtained. However, due to imperfect CSIT, it is emphasized that the capacity prediction is a big challenge during the 
above derivation. To cope with this problem, a capacity estimation mechanism is presented and accurate capacity 
estimation strategies are proposed. 

The derivation of the optimal transmit power and bandwidth reveals the relationship between the BPJ-EE and the 
mode. Applying the derived optimal transmit power and bandwidth, mode switching is addressed then to choose 
the optimal mode. An ergodic capacity based mode switching algorithm is proposed. We derive the accurate close- 
form capacity approximation for each mode under imperfect CSIT at first and calculate the optimal BPJ-EE of 
each mode based on the approximation. Then, the preferred mode can be decided after comparison. The proposed 
mode switching scheme provides guidance on the preferred mode under given scenarios and can be applied offline. 
Simulation results show that the mode switching improves the BPJ-EE significantly and it is promising for the 
energy efficient transmission. 

The rest of the paper is organized as follows. Section II introduces the system model, power model and two 
transmission schemes and then section III gives the problem definition. Optimal bandwidth, transmit power derivation 
for each dedicated mode and capacity estimation under imperfect CSIT are presented in section IV. The ergodic 
capacity based mode switching is proposed in section V. The simulation results are shown in section VI and finally, 
section VII concludes this paper. 

Regarding the notation, bold face letters refer to vectors (lower case) or matrices (upper case). Notation E(A) 
and Tr(A) denote the expectation and trace operation of matrix A, respectively. The superscript H and T represent 
the conjugate transpose and transpose operation, respectively. 
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II. Preliminaries 

A. System model 

The downlink MIMO systems consist of a single BS with M antennas and K users each with N antennas. 
M > K X N IS, assumed. We assume that the channel matrix from the BS to the /cth user at time n is Hfc[n] e 
^NxM ^ fc = 1, . . . , ii', which can be denoted as 

Hfe[n] = CfeHfcH = $d^^*HfeH. (1) 

C,k — '^dk ^ is '^he large scale fading including pathloss and shadowing fading, in which d^, A denote the distance 
from the BS to the user k and the path loss exponent, respectively. The random variable ^ accounts for the 
shadowing process. The term $ denotes the pathloss parameter to further adapt the model which accounts for the 
BS and MS antenna heights, carrier frequency, propagation conditions and reference distance. Hfc[?^] denotes the 
small scale fading channel. We assume that the channel experiences flat fading and Hj,[n] is well modeled as a 
spatially white Gaussian channel, with each entry CN{Q, 1). 
For the fcth user, the received signal can be denoted as 

Yfe ["-] = Hfe [n] X [n] + rife [n] , (2) 

in which x[n] G ^Afxi j^ jjjg gi^'^ transmitted signal, nfe[7i] is the Gaussian noise vector with entries distributed 
according to CJ\f{0, NqW), where iVo is the noise power density and W is the carrier bandwidth. The design of 
x[n] depends on the transmission schemes which would be introduced in subsection III-CI 

As one objective of this paper is to study the impact of imperfect CSIT, we will assume perfect channel state 
information at the receive (CSIR) and imperfect CSIT here. CSIT is always get through feedback from the MSs in 
the FDD systems and through uplink channel estimation based on uplink-downlink reciprocity in the TDD systems, 
so the main sources of CSIT imperfection come from channel estimation error, delay and feedback error ifTSl - llTTl . 
Only the delayed CSIT imperfection is considered in this paper, but note that the delayed CSIT model can be simply 
extended to other imperfect CSIT case such as estimation error and analog feedback |fT5l , |fT6l . The channels will 
stay constant for a symbol duration and change from symbol to symbol according to a stationary correlation model. 
Assume that there is D symbols delay between the estimated channel and the downlink channel. The current channel 
Hfe[n] = CfeHfe[n] and its delayed version Hfe[7i — D] = CfeHfe[n — D] are jointly Gaussian with zero mean and are 
related in the following manner |fT6ll . 

Uk[n]^Pkilk[n-D] + Ek[n], (3) 

where pk denotes the correlation coefficient of each user, Efe[?7] is the channel error matrix, with i.i.d. entries 
CAf{Q, Eg fe) and it is uncorrected with Hfe[n — D]. Meanwhile, we denote Efc[n] = CfeEfe[n]. The amount of delay 
is T = DTs, where T, is the symbol duration. And pk = Jo(27r/d fer) with Doppler spread fd.k, where Jo(-) is the 
zeroth order Bessel function of the first kind, and Eg fe = 1 — Pfc HM- Therefore, both pk and ec,k are determined 
by the normalized Doppler frequency fd,kT- 
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B. Power model 

Apart from PA power and the circuit power, the signal processing, power supply and air condition power should 
also be taken into account at the BS |fT9l . Before introduction, assume the number of active transmit antennas is 
Ma and the total transmit power is Pt. Motivated by the power model in Q, ifTol . lfT9l . the three part power model 
is introduced as follows. The total power consumption at BS is divided into three parts. The first part is the PC 
power 

Ppc - f , (4) 

in which rj is the power conversion efficiency, accounting for the PA efficiency, feeder loss, and extra loss in 
transmission related cooling. Although the total transmit power should be varied as Ma and W changes, we study 
the total transmit power as a whole and the PC power includes all the total transmit power The effect of M^ and W 
on the transmit power independent power is expressed by the second part: the dynamic power PDyn- Poyn captures 
the effect of signal processing, circuit power, etc., which is dependent of A/a and W but independent of Pf ft>yn 
is separated into three classes. The first class "Dyn-I" PDyn-i is proportional to the transmit antenna number only, 
which can be viewed as the circuit power of the RF. The second part "Dyn-II" PDyn-ii is proportional to the 
bandwidth only, and the third part "Dyn-III" PDyn-iii is proportional to the multiplication of the bandwidth and 
transmit antenna number PDyn-ii and PDyn-iii can be viewed as the signal processing related power. Thus, the 
dynamic power can be denoted as follows. 

ft>yn = Poyn-l + ft>yn-II + ft>yn-III, 
ft)yn-I — Mi^Pcir, 
ft)yn-II ~ Pac.hwW, 
ft>yn-III — MaPsp,hwW, 

The third part is the static power Psta which is independent of Pt, ^a and W, including the power consumption 
of cooling systems, power supply and so on. Combining the three parts, we have the total power consumption as 
follows. 

-Ptotal = PpC + PByn + Psta- (6) 

Although the above power model is simple and abstract, it captures the effect of the key parameters such as Pt, 
M^ and W and coincides with the previous literatures Q, ifTOl . |fT9l . Measuring the accurate power model for a 
dedicated BS is very important for the research of energy efficiency and the measuring may need careful field test, 
however, it is out of the scope here. 

Note that here we omit the power consumption at the user side, as the users' power consumption is negligible 
compared with the power consumption of BS. Although any BS power saving design should consider the impact 
to the users' power consumption, it is beyond the scope of this paper 
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C. Transmission Schemes 

SU-MIMO with SVD and MU-MIMO with BD are considered in this paper as the transmission schemes. We 
will introduce them in this subsection. 

1) SU-MIMO with SVD: Before discussion, we assume that A/a transmit antennas are active in the SU-MIMO. 
As more active receive antennas result in transmit power saving due to higher spatial multiplexing and diversity 
gain, N antennas should be all active at the MS sidqj. The number of data streams is limited by the minimum 
number of transmit and receive antennas, which is denoted as Ng — min(i\/a, N). 

In the SU-MIMO mode, SVD with equal power allocation is applied. Although SVD with waterfilling is the 
capacity optimal scheme ||20| . considering equal power allocation here helps the comparison between SU-MIMO 
and MU-MIMO fairly ^. The SVD of H[7i] is denoted as 

H[n] = U[n]A[n]V[n]^, (7) 

in which A[n] is a diagonal matrix, U[n] and V[n] are unitary. The precoding matrix is designed as V[n] at the 
transmitter in the perfect CSIT scenario. However, when only the delayed CSIT is available at the BS, the precoding 
matrix is based on the delayed version which should be V[?t. — Z?]. After the MS preforms MIMO detection, the 
achievable capacity can be denoted as 

Rg{M,,P,,W) ^W J: log (l + j^^X^), (8) 

where Ai is the ith singular value of H[n]V[n — D]. 



i—1 ^ 



2) MU-MIMO with BD: We assume that K^ users each with Na,i, i = I, . . . , K^ antennas are active at the same 

time. Denote the total receive antenna number as iVa — ^ -/Va.i- As linear precoding is preformed, we have that 

1=1 
M^ > iVa im, and then the number of data streams is N^ ~ N^,. The BD precoding scheme with equal power 

allocation is applied in the MU-MIMO mode. Assume that the precoding matrix for the kth user is Tfc [n] and the 

desired data for the fcth user is Sk[n\, then x[n] = ^ Ti[?^]s4?^]. The transmission model is 

j=i 

Yk [n] = Hfe [n] ^ T, [n] s, [n] + rife [n] . (9) 

i=l 

In the perfect CSIT case, the precoding matrix is based on Hfe[n] ^ Ti[n] = 0. The detail of the design can 

i—l.i^k 

be found in ifTTJI . Define the effective channel as Hoff fe[n] = Hfe[n]Tfe[n]. Then the capacity can be denoted as 

<(Ma, i^a, iVa,i, . . . , iVa,Ka, ^t, M/) = 

K. / X (10) 

W E logdet (l+ ^^^Heff^feMHfff_,M) . 



k=l 



^{D)\ 



In the delayed CSIT case, the precoding matrix design is based on the delayed version, i.e. Hfe [n— Z?] ^ T,- [n] 

i—l.i^k 

0. Then define the effective channel in the delayed CSIT case as Hcff,fcb^] = Hfe[ri]TJ^ '[n]. The capacity can be 

' Here more receive antenna at MS will cause higher MS power consumption. However, note that the power consumption of MS is omitted. 
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denoted as lfT6l 



in which 



i?^ (Ma, i^a, iVa,l , . . . , ^a,K. , Pt , W^) = 

W § logdet (l+ f Heff.,NHfs^Jn]R-i[ '' 



-'(^)r^iT'(^)^r 






Ef[n]+7VoH/I (12) 



Rfc[n] = ^Efe[n] 
is the inter-user interference plus noise part. 

III. Problem Definition 

The objective of this paper is to maximize the BPJ-EE in the downlink MIMO systems. The BPJ-EE is defined 
as the achievable capacity divided by the total power consumption, which is also the transmitted bits per unit energy 
(Bits/Joule). Denote the BPJ-EE as ^ and then the optimization problem can be denoted as 

^ -Ttotal 

s.t. Ptx > 0, (13) 

According to the above problem, bandwidth limitation is considered. In order to make the transmission most energy 
efficient, we should adaptively adjust the following system parameters: transmission scheme m e {s, b}, i.e. use 
of SVD or BD, number of active transmit antennas M^, number of active users K;^, number of receive antennas 
^a,j, i — It ■ . , A'a, transmit power Pt and bandwidth W . 

The optimization of problem ( fT3l ) is divided into two steps. At first, determine the optimal Pt and W for each 
dedicated mode. After that, apply mode switching to determine the optimal mode, i.e. optimal transmission scheme 
m, optimal transmit antenna number Ma, optimal user number A'a and optimal receive antenna number N^^i, 
according to the derivations of the first step. The next two sections will describe the details. 

IV. Maximizing Energy Efficiency with Optimal Bandwidth and Transmit Power 

The optimal bandwidth and transmit power are derived in this section under a dedicated mode. Unless otherwise 
specified, the mode, i.e. transmission scheme m, active transmit antenna number Afa, active receive antenna number 
Na„i,i — 1, . . . , ATa and active user number /fa, is constant in this section. The following lemma is introduced at 
first to help the derivation. 

Lemma 1: For optimization problem 

max ^ ' 



S.t. x> 
in which a > and & > 0. f{x) > (a; > 0) and f{x) is strictly concave and monotonically increasing. There 
exists a unique globally optimal x* given by 

x*^Iifl b (15) 

where / (x) is the first derivative of function f{x). 
Proof: See Appendix lAl 
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A. Optimal Energy Efficient Bandwidth 

To illustrate the effect of bandwidth on the BPJ-EE, the following theorem is derived. 
Theorem 1: Under constant Pt, there exists a unique globally optimal W* given by 

TI^* ^ (PpC + Pst., + M^Poi,) + (MaP„p,bw+Pac.bw)fl(M^*) (-IfC^, 

(M,p,p,bw + Pao,bw)fl'(W-) ^ "'' 

to maximize ^, in which R{W) denotes the achievable capacity with a dedicated mode. If the transmit power scales 
as Pt = PtW , £, is monotonically increasing as a function of W. 

Proof: See Appendix |B] 

This Theorem provides helpful insights about the system configuration. When the transmit power of BS is fixed, 
configuring the optimal bandwidth helps improve the energy efficiency. Meanwhile, if the transmit power can 
increase proportionally as a function of bandwidth based on Pt = ptW, transmitting over the whole available 
spectrum is thus the optimal energy efficient transmission strategy. As P^ can be adjusted in problem ( fT3] l and 
no maximum transmit power constraint is considered there, choosing W* = W^ax as the optimal bandwidth can 
maximize ^. Therefore, W* = Wmax is applied in the rest of this paper. 

One may argue that the transmit power is limited by the BS's maximum power in the real systems. In that case, 
W and Pt should be jointly optimized. We consider this problem in our another work ll23l . 

B. Optimal Energy Efficient Transmit Power 

After determining the optimal bandwidth, we should derive the optimal P^* under W* — Wmax- In this case, we 
denote the capacity as R{Pt) with the dedicated mode. Then the optimal transmit power is derived according to 
the following theorem. 

Theorem 2: There exists a unique globally optimal transmit power P^* of the BPJ-EE optimization problem given 

by 

^t* = f(^-'7(Psta + PDy„). (17) 

Proof: See Appendix |C] 

Therefore, the optimal bandwidth and transmit power are derived based on Theorem [T] and Theorem |2] That is 
to say, the optimal bandwidth is chosen as W* — VFmax and the optimal transmit power is derived according to 

dnj. 

However, note that during the optimal transmit power derivation ( fTTj i, the BS needs to know the achievable 
capacity based on the CSIT prior to the transmission. If perfect CSIT is available at BS, the capacity formula 
can be calculated at the BS directly according to dSll for SU-MIMO with SVD and ([TOll for MU-MIMO with BD. 
But if the CSIT is imperfect, the BS need to predict the capacity then. In order to meet the challenge, a capacity 
estimation mechanism with delayed version of CSIT is developed, which is the main concern of next subsection. 
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C. Capacity Estimation Under Imperfect CSIT 

1) SU-MIMO: SU-MIMO with SVD is relatively robust to the imperfect CSIT ^6\, using the delayed version 
of CSIT directly is a simple and direct way. Following proposition shows the capacity estimation of SVD mode. 

Proposition 1: The capacity estimation of SU-MIMO with SVD is directly estimated by: 

Rr = W^ E log (l + T^>^) , (18) 

where A^ is the singular value of ll[n — D]. 

Proposition [T] is motivated by II16I . In Proposition [T] when the receive antenna number is equal to or larger than 
the transmit antenna number, the degree of freedom can be fully utilized after the receiver's detection, and then the 
ergodic capacity of ( fTSl l would be the same as the delayed CSIT case in ([8]). When the receive antenna number 
is smaller than the transmit antenna number, although delayed CSIT would cause degree of freedom loss and ( fTsl ) 
cannot express the loss, the simulation will show that Proposition [T] is accurate enough to obtain the optimal ^ in 
that case. 

2) MU-MIMO: Since the imperfect CSIT leads to interuser interference in the MU-MIMO systems, simply 
using the delayed CSIT can not accurately estimate the capacity any longer. We should take the impact of interuser 
interference into account. J. Zhang et al. first considered the performance gap between the perfect CSIT case and 
the imperfect CSIT case lfT6l . which is described as the following lemma. 

Lemma 2: The rate loss of BD with the delayed CSIT is upper bounded by lfT6l : 



wf:7Va,felog2 
fc=l 






i—l^i^k 



(19) 



As the BS can get the statistic variance of the channel error e^ j, due to the doppler frequency estimation, the 
BS can obtain the upper bound gap Ai?jJ^'' through some simple calculation. According to Proposition [T] we can 
use the delayed CSIT to estimate the capacity with perfect CSIT R^ and we denote the estimated capacity with 
perfect CSIT as 

7-)CSt,P 

K^ (20) 

W E^ log dct (I + i^^Heff., [n - i?]Hf^ , [n-D]), 

in which Hcff,fe[n — I?] = tlkin — D]Tk[n — D]. Combining ( l20b and Lemma|2] a lower bound capacity estimation 
is denoted as the perfect case capacity R°^ ' minus the capacity upper bound gap Ai?J^^^, which can be denoted 
as (HU 

^cst-Zhang ^ ^cst,P _ ^^upp (21) 

However, this lower bound is not tight enough, a novel lower bound estimation and a novel upper bound estimation 
are proposed to estimate the capacity of MU-MIMO with BD. 

Proposition 2: The lower bound of the capacity estimation of MU-MIMO with BD is given by ( |22] |. while the 
upper bound of the capacity estimation of MU-MIMO with BD is given by (l23T l. The lower bound in ( |22] | is tighter 
than i?-t'Zha„g .^ ^ 
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^est,ow ^ ^ g i^g^^^ (J ^ j^./N. jjeff,fc[n - D]Hfff,,[n - D]) (22) 

fe=l NoW+ E JV,,iT^«^e,fc 



^est.upp = ly E <( 



■■» Wa ''e.fc 

(23) 



logdet (I + j^ Heff,fe[n - 2?]Hfff ,[n - D]) + {N^.u/M,) log^Ce) 



Proposition |2] is motivated by ll2n . It is illustrated as follows. Rewrite the transmission mode of user A: of (|9l) as 

Yk [n] = Hfe [n] Tfe [n] s^ [n] + Hfc [n] X) T^ [n] Si [n] + n^ [n] . (24) 

With delayed CSIT ,denote 
then A/i.[n] = B/i.[n]B^[?^] and the covariance matrix of the interference plus noise is then 



Kk[n]^-^Ak[n] + NoWI[n]. (25) 



The expectation of Rfc [n] is lfT6l 



-ffa 



E(Rfe[n])= E iVa,,^e2^^i + AroM^I (26) 

i—l^i^k 

Based on Proposition [T] we use Hoff.fe[n — D] with the delayed CSIT to replace the Hoff,feM in ( fTTT ). And 
then the capacity expression of each user is similar with the SU-MIMO channel with inter-stream interference. 
The capacity lower bound and upper bound with a point to point MIMO channel with channel estimation errors in 
II2TI is appUed here. Therefore, the lower bound estimation ( |22] | and upper bound estimation ( |23] ) can be verified 
according to the lower and upper bounds in II2TI and ( |26] |. 

We can get R*^ ' "^ — R'^ ' ^"^ > after some simple calculation, so R'^ ' °* is tighter than R'^ ' ^^'^e^ q 
According to Proposition [T] and Proposition |2] the capacity estimation for both SVD and BD can be performed. In 
order to apply Proposition [T| and Proposition |2] to derive the optimal bandwidth and transmit power, it is necessary 
to prove that the capacity estimation ( fTSl ) for SU-MIMO and ( |22] | ( |23] ) for MU-MIMO are all stricdy concave and 
monotonically increasing. At first, as R°^^ in (fTsl l is similar with Rs{Ma., Pt, W) in (H), the same property of strictly 
concave and monotonically increasing of (fTsT i is fulfilled. About (l22T l and ( |23] ). the proof of strictly concave and 
monotonically increasing is similar with the proof procedure in Theorem|2] If we denote gk^i > 0,i — I, . . . , A^a,fc 
as the eigenvalues of Heff,fe[7i — Z?]H^ /ci" ^ D], (l22l) and ( |23] ) can be rewritten as 

i?f .1- ^WE E iog(i + .y"' ^^ — gk,) 

k=l i=l NoW+ E ^a,i-^«^fc 
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and 



i?f^"PP = VF^ 



fc=l 



N, 



2^ log(l + gua) 



+ (^a,fe/Ma)l0g2(e) 



respectively. Calculating the first and second derivation of the above two equations, it can be proved that ( l22b and 
(l23T l are both strictly concave and monotonically increasing in Ft and W . Therefore, based on the estimations of 
Proposition [1] and Proposition |2] the optimal bandwidth and transmit power can be derived at the BS. 

V. Energy Efficient Mode Switching 

A. Mode Switching Based on Instant CSIT 

After getting the optimal bandwidth and transmit power for each dedicated mode, choosing the optimal mode 
with optimal transmission mode m*, optimal transmit antenna number Af*, optimal user number K^ each with 
optimal receive antenna number N* ^ is important to improve the energy efficiency. The mode switching procedure 
can be described as follows. 

Energy Efficient Mode Switching Procedure 

Step 1. For each transmission mode m with dedicated active transmit antenna number Mg., active user number 
Kg. and active receive antenna number iVa.i, calculate the optimal transmit power P^ and the corresponding BPJ-EE 
according to the bandwidth W* = Wmax and capacity estimation based on Proposition [T] and Proposition |2l 

Step 2. Choose the optimal transmission mode m* with optimal M*, K^ and iV*j with the maximum BPJ-EE. 
D 

The above procedure is based on the instant CSIT. As we know, there are two main schemes to choose the 
optimal mode in the spectral efficient multimode transmission systems. The one is based on the instant CSIT [:12j- 
lfT4l . while the other is based on the ergodic capacity ifTSJI - lflTl . The ergodic capacity based mode switching can 
be performed off line and can provide more guidance on the preferred mode under given scenarios. If applying 
the ergodic capacity of each mode in the energy efficient mode switching, similar benefits can be exploited. The 
next subsection will present the approximation of ergodic capacity and propose the ergodic capacity based mode 
switching. 

B. Mode Switching Based on the Ergodic Capacity 

Firstly, the ergodic capacity of each mode need to be developed. The following lemma gives the asymptotic result 
of the point to point MIMO channel with full CSIT when Ma > N^. 

Lemma 3: lfT6l . Il22ll For a point to point channel when M^ > N^,, denote /? = ^ and 7 = nw ' ^^^ capacity 
is approximated as 

i?appro^^Ci,o(/3,/37) (27) 
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in which Cjso is the asymptotic spectral efficiency of the point to point channel, and Ciso can be denoted as 






l + 7--F(/3,^) 



+/31og 



log2(e) 



+ ^-J-(/?,^) -/3^2I^J-(/3,2) 



(28) 



with 



T{x,y) 



^1 + y{l + V5)2 - ^1 + y(l - V^)2 



As SVD is applied in the SU-MIMO systems, the transmission is aligned with the maximum Ng singular vectors. 
When Afa < N^,, the achievable capacity approximation is modified as 



Rappr 



WCU^Jj), 



(29) 



where /3 = i = ^. 

Therefore, according Proposition [1] the following proposition can be get directly. 
Proposition 3: The ergodic capacity of SU-MIMO with SVD is estimated by: 

oErgodic oappro 



(30) 



Although Zhang et al. give another accurate approximation in lfT6ll for the MU-MIMO systems with BD, it is 

only applicable in the scenario in which ^ A^a.i = M^,. We develope the ergodic capacity estimation with BD 

1=1 
based on Proposition |2l 

As Ti;[n — D] is designed to null the interuser interference, it is a unitary matrix independent of Hfc[n — D]. 
So Hfe[n — D]Tk['n' — D] is also a zero-mean complex Gaussian matrix with dimension A^a,*; x Afa,fe, where 
Ma,fe — Ma — J2 ^a,j- The effective channel matrix of user k can be treated as a SU-MIMO channel with 
transmit antenna number Ma.fc and receive antenna number A'^a^fc. Combining Proposition [T] Proposition |2] and 
Proposition [3] we have the following Proposition. 

Proposition 4: The lower bound of the ergodic capacity estimation of MU-MIMO with BD is given by 



R 



Ergodic— low 



wY.Ciso0kJk%), 

k=l 



while the upper bound of the ergodic capacity estimation of MU-MIMO with BD is given by 



R 



Ergodic — upp 



where 



7fe 



Wt [Ciso(/3fe,/3fe7fc) + f log2(e) 

fe=l L '^'' 



fik = Ma^fe/A^a.fe, 
PtQk 



(31) 



(32) 






AoT^ + 

i^l.i^k 

For comparison, the ergodic capacity lower bound based on (1211 1 is also considered. As shown in ( fT9] l, the 
expectation can be denoted as 

E(it!^ -R^)< E(Ai?^PP). 
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As Ai?;jPP is a constant, we have E(Ai?^'PP) = Ai?;^PP, and then 

E«)-E(i?^)< A<PP. (33) 

Therefore, the lower bound estimation in i2T[ can also be applied to the ergodic capacity case. As the expectation 

of ( |20| | can be denoted as lfT6ll 

E(i?f '^) ^w'j: CUh, Pkl). (34) 

fc=l 

the low bound ergodic capacity estimation can be denoted as 

^Ergodic-Zhang ^ ^ g CUh.hl) " A<PP. (35) 

fe=l 

After getting the ergodic capacity of each mode, the ergodic capacity based mode switching algorithm can be 
summarized as follows. 

Ergodic Capacity Based Energy Efficient Mode Switching 

Step 1. For each transmission mode m with dedicated Ma, K^ and A^a,i, calculate the optimal transmit power 
P* and the corresponding BPJ-EE according to the bandwidth W* — Wmax and ergodic capacity estimation based 
on Proposition [3] and Proposition |4] 

Step 2. Choose the optimal m* with optimal AI^, K* and N*^ with the maximum BPJ-EE. D 

According to the ergodic capacity based mode switching scheme, the operation mode under dedicated scenarios 
can be determined in advance. Saving a lookup table at the BS according to the ergodic capacity based mode 
switching, the optimal mode can be chosen simply according to the application scenarios. The performance and the 
preferred mode in a given scenario will be shown in the next section. 

VI. Simulation Results 

This section provides the simulation results. In the simulation, M = 6, N = 2 and K = 3. All users are assumed 
to be homogeneous with the same distance and moving speed. Only pathloss is considered for the large scale fading 
model and the pathloss model is set as 128.1 + 37.6 logj^Q dk dB (dk in kilometers). Carrier frequency is set as 2GHz 
and D — 1ms. Noise density is A'o = — 174dBm/Hz. The power model is modified according to |fT9L which is set 
as 77 = 0.38, Pcir = 66.4W, Psta = 36.4W, ftp^bw = 3.32^W/Hz and pac.bw = 1.82^W/Hz. W^^ax = 5MHz. For 
simplification, "SU-MIMO (A/a,iVa)" denotes SU-MIMO mode with A/a active transmit antennas and iVa active 
receive antennas, "SIMO" denotes SU-MIMO mode with one active transmit antennas and N active receive antennas 
and "MU-MIMO {Ma,Na,K^)'" denotes MU-MIMO mode with A/a active transmit antennas and /Ca users each A^a 
active receive antennas. Seven transmission modes are considered in the simulation, i.e. SIMO, SU-MIMO (2,2), 
SU-MIMO (4,2), SU-MIMO (6,2), MU-MIMO (4,2,2), MU-MIMO (6,2,2), MU-MIMO (6,2,3). In the simulation, 
the solution of (fTStdTSl) and ( flTl l is derived by the Newton's method, as the close-form solution is difficult to get. 

Fig. [T] depicts the effect of capacity estimation on the optimal BPJ-EE under different moving speed. The optimal 
estimation means that the BS knows the channel error during calculating P* and the precoding is still based on 
the delayed CSIT. In the left figure, SU-MIMO is plotted. The performance of capacity estimation and the optimal 
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Fig. 1. The effect of capacity estimation on the energy efficiency of SU-MIMO and MU-MIMO under different speed. 
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Fig. 2. Comparison of energy efficiency based on ergodic capacity and instant capacity with SU-MIMO and MU-MIMO 

estimation are almost the same, which indicates that the capacity estimation of the SU-MIMO systems is robust 
to the delayed CSIT. Another observation is that the BPJ-EE is nearly constant as the moving speed is increasing 
for SIMO and SU-MIMO(2,2), while it is decreasing for SU-MIMO(4,2) and SU-MIMO(6,2). The reason can be 
illustrated as follows. The precoding at the BS can not completely align with the singular vectors of the channel 
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Fig. 3. Performance of mode switching 

matrix under the imperfect CSIT. But when the transmit antenna number is equal to or greater than the receive 
antenna number, the receiver can perform detection to get the whole channel matrix's degree of freedom. However, 
when the transmit antennas are less than the receive antenna, the receiver cannot get the whole degree of freedom 
only through detection, so the degree of freedom loss occurs. The center and right figures show us the effect of 
capacity estimation with MU-MIMO modes. The three estimation schemes all track the effect of imperfect CSIT. 
From the amplified sub-figures, the upper bound capacity estimation is the closest one to the optimal estimation. 
It indicates that the upper bound capacity estimation is the best one in the BD scheme. Moreover, we can see that 
BPJ-EE of the BD scheme decreases seriously due to the imperfect CSIT caused inter-user interference. 

Fig. |2] compares the BPJ-EE derived by ergodic capacity estimation schemes and the one by simulations. The 
left figure demonstrates the SU-MIMO modes. The estimation of SIMO, SU-MIMO(4,2) and SU-MIMO(6,2) is 
accurate when the moving speed is low. But when the speed is increasing, the ergodic capacity estimation of 
SU-MIMO(4,2) and SU-MIMO(6,2) can not track the decrease of BPJ-EE. There also exists a gap between the 
ergodic capacity estimation and the simulation in the SIMO mode. Although the mismatching exists, the ergodic 
capacity based mode switching can always match the optimal mode, which will be shown in the next figure. For the 
MU-MIMO modes, the two lower bound ergodic capacity estimation schemes mismatch the simulation more than 
the upper bound estimation scheme. That is because the lower bound estimations cause BPJ-EE decreasing twice. 
Firstly, the derived transmit power would mismatch the exactly accurate transmit power because the derivation is 
based on a bound and this transmit power mismatch will make the BPJ-EE decrease compared with the simulation. 
Secondly, the lower bound estimation use a lower bound formula to calculate the estimated BPJ-EE under the 
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derived transmit power which will make the BPJ-EE decrease again. Nevertheless, the upper bound estimation has 
the opposite impact on the BPJ-EE estimation during the above two steps, so it matches the simulation much better. 
According to Fig. [T] and Fig.|2] the upper bound estimation is the best estimation scheme for the MU-MIMO mode. 
Therefore, during the ergodic capacity based mode switching, the upper bound estimation is applied. 

Fig. [3] depicts the BPJ-EE performance of mode switching. For comparison, the optimal mode with instant CSIT 
('Optimal') is also plotted. The mode switching can improve the energy efficiency significantly and the ergodic 
capacity based mode switching can always track the optimal mode. The performance of ergodic capacity based 
switching is nearly the same as the optimal one. Through the simulation, the ergodic capacity based mode switching 
is a promising way to choose the most energy efficient transmission mode. 
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Fig. 4. Optimal mode under different scenario, o: SIMO, X : SU-MIMO (2,2), +: SU-MIMO (4,2),n: SU-MIMO (6,2),0: MU-MIMO (4,2,2), V: 
MU-MIMO (6,2,2),<]: MU-MIMO (6,2,3). 

Fig. |4] demonstrates the preferred transmission mode under given scenarios. The optimal mode under different 
moving speed and distance is depicted. This figure provides insights on the PC power/dynamic power/static power 
tradeoff and the multiplexing gain/inter-user interference compromise. When the moving speed is low, MU-MIMO 
modes are preferred and vice versa. This result is similar to the spectral efficient mode switching in ifTSJI - lfTSl . 
Inter-user interference is small when the moving speed is low, so higher multiplexing gain of MU-MIMO benefits. 
When the moving speed is high, the inter-user interference with MU-MIMO becomes significant, so SU-MIMO 
which can totally avoid the interference is preferred. Let us focus on the effect of distance on the mode under high 
moving speed case then. When distance is less than 1.7km, SU-MIMO (2,2) is the optimal one, while the distance 
is equal to 2.1 km and 2.5km, the SIMO mode is suggested. When the distance is larger than 2.5km, the active 
transmit antenna number increases as the distance increases. The reason of the preferred mode variation can be 
explained as follows. As the total power can be divided into PC power, transmit antenna number related power 
"Dyn-I" and "Dyn-III" and transmit antenna number independent power "Dyn-II" and static power The first and 
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third part divided by capacity would increase as the active number increases, while the second part is opposite. In 
the long distance scenario, the first part will dominates the total power and then more active antenna number is 
preferred. In the short and medium distance scenario, the second and third part dominate the total power and the 
tradeoff between the two parts should be met. Above all, the above mode switching trends of Fig. |4] externalize the 
two tradeoffs. 

VII. Conclusion 

This paper discusses the energy efficiency maximizing problem in the downUnk MIMO systems. The optimal 
bandwidth and transmit power are derived for each dedicated mode with constant system parameters, i.e. fixed trans- 
mission scheme, fixed active transmit/receive antenna number and fixed active user number. During the derivation, 
the capacity estimation mechanism is presented and several accurate capacity estimation strategies are proposed to 
predict the capacity with imperfect CSIT. Based on the optimal derivation, ergodic capacity based mode switching 
is proposed to choose the most energy efficient system parameters. This method is promising according to the 
simulation results and provides guidance on the preferred mode over given scenarios. 

Appendix A 
Proof of Lemma[T] 

Proof: The proof of the above lemma is motivated by ||4l. Denote the inverse function of y = f{x) as x = g{y), 
then X* — argmax^r ^Si ~ £irgmaxg(j,) ITTT+b' Denote y* ~ f{x*). Since f{x) is monotonically increasing, 
y* — argmaxj, (^)+f, - According to H, there exists a unique globally optimal y* given by 

y* = b+Mv') (36) 

if g{y) is strictly convex and monotonically increasing. ( [36b is fulfilled since the inverse function of g{y), i.e. f{x) 
is strictly concave and monotonically increasing. Taking g (y) = ^ and f{x) — y into ( l36b , we can get ( fTSb . 

Appendix B 
Proof of Theorem[T] 

Proof: The first part can be proved according to Lemma [T] Calculating the first and second derivation of R{W) 
based on ^, (fTOl i and ( fTTT i. we can see that R{W) of both SVD and BD mode is strictly concave and monotonically 
increasing as a function of W. The optimal W* can be get through (fTSl l, which is given by ( fTSI l. 

Look at the second part. Taking Ft = PtW into (|8]l, ( fTol i and (fTTT i. the capacity is -R(Pt, W) — WRmiPt), where 
R{pt) is independent of W . We have that 



c _ WR(pt) (-37-1 



The second part is verified. 
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Appendix C 
Proof of Theorem[2] 

Proof: According to Lemma [T] the above theorem can be verified if we prove that Rm{Pt) is strictly concave 
and monotonically increasing for both SVD and BD. It is obvious that the capacity of SVD and BD with perfect 
CSIT is strictly concave and monotonically increasing based on dD and ( fTol i. If the capacity of BD with imperfect 
CSIT can also be proved to be strictly concave and monotonically increasing. Theorem |2] can be proved. 



Denoting A^ = E/j[n] 



A', 



E^[rt], then rewrite (fTTI) as follows: 



RbiPt) ^WE {logdet (RfeM + ^Heff,feWHfff^,[n]) - logdetRfc[n]} 

= wt {logdet [l + ^^^ (Afc + Heff^feNHfff^febi])] - logdet (l + ^vi^fivjAfc) } 

= W^ E E |log(l + iv^cfe,) - log(l + T^^^gu,)] . 

k=l i=l '- J 



(38) 



Ck^i and gk.i are the eigenvalue of A^ + Hoff.fcMH^ ^.[n] and A^, respectively. Sorting Ck^i and gk^i as Ck.i > 
...> Ck,N.,,k and gks > ■■■> gk,N^,k- Since Afc and Hoff^fcMH^_j.[7i] are both positive definite, Ck,i > gk,i,i = 
1, . . . , -/Va,fe- Calculating the first and second derivation of (|38] |. (fTTl i is strictly concave and monotonically increasing. 
Then Theorem |2] is verified. 
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